ISSS608 G1 Group 5


Chen Yuxi 1

Lim Yong Kai 1

Jovinka Hartanto 1

1 Singapore Management University

Introduction

The VAST 2021 Mini Challenge 2 outlines a hypothetical scenario in which several GAStech employees have gone missing and the organisation, Protectors of Kronos (POK), is suspected of being involved. The challenge requires identification of suspicious activities hidden in data and determine any dubious people and locations that should be reported to the law enforcement using interactive visual analytical tools.

Motivation and Objectives

Our research and development effort aims to enable Kronos law enforcement with the ability to easily analyse, drill down and identify key suspects and suspicious locations, and thereby speeding up the investigation process.

The use cases of the data visualisation tool include but are not limited to the following:

  • The most popular locations and when they are popular
  • Infer the owner of each credit card and loyalty card
  • Identify potential informal or unofficial relationships among GASTech personnel
  • Analyze unusual and suspicious activities of the employees
  • Analyze daily routines of GasTech employees

Approach and Methodology

The application aims to provide users actionable insights based on the following analyses:

  • Exploratory Data Analysis (EDA) of GPS tracking data and credit and loyalty card transaction patterns
  • Visualisation of employee movements over time and associated purchase transactions
  • Network analysis of employees based on their visited locations

Application

The STAGEM Application was designed using R Shiny and hosted on shinyapps.io server. The application has 5 main section with various interactive visual analytics tools to allow users to investigate the data. The user guide to the application is available at https://isss608g1group5.netlify.app/userguide.html.

Abila Kronos Map

The main tab shows the GPS movement data of each employee’s car. Figure 1 shows Abila map with each employee’s car GPS movement data and stop locations.
Abila Map

Figure 1: Abila Map

Figure 2 is a scatter plot that gives an overview of the cars that visited the specific locations during different time period of the day.
Scatter Plot of Locations visited

Figure 2: Scatter Plot of Locations visited

Locations Transactions

The main tab of Location Transactions allow users to investigate GASTech employee’s credit card transactions at the various locations made in Abila. There are three interactive visualisation that shares five filters where users can select by location, employee, date, time period of transaction and department of each employee to dynamically update the visualisations.

Heatmap of Location Visit Frequency

Figure 3 provides an overview of the frequency of transactions at each location by date. Hovering over each region will show the micro-data of frequency count for that location by date.
Heatmap

Figure 3: Heatmap

Boxplot of Credit Card Transactions

Figure 4 displays each credit card transaction price at varying locations. Hovering over the boxplot will display the micro-data of the median price for the location and the corresponding credit card number and transaction price.
Boxplot

Figure 4: Boxplot

Barplot of Number of Transactions per Day per Location

Figure 5 displays the number of transactions for each location by date and colored by time period. Hovering over the columns displays the micro-data of the number of transactions performed at the location on the date during the time period.
Stacked Barplot

Figure 5: Stacked Barplot

Card Mapping

Figure 6 displays the credit card and loyalty card mapping to each GASTech employee with reference to their car GPS data. Users can filter different credit card number to dynamic change the plot to observe the respective transactions made at the different locations. The difference in price of each matching credit card and loyalty card is also made available in the plot. Users can utilise the interactive plot to highlight different selections throughout the parallel coordinate plot.
Parallel Coordinate Plot

Figure 6: Parallel Coordinate Plot

Network Analysis

Network analysis depicts the movement of GASTech employees to corresponding locations. There are two network analysis presented based on their credit card transaction history or on their car GPS movement and locations. Figure 7 displays the network analysis based on credit card transactions. Each node represents an employee or location and each edge represents the connection between each nodes.
Network Analysis from Credit Card Transactions

Figure 7: Network Analysis from Credit Card Transactions

Future Work

Future work would have to be done to integrate the different visualizations developed as the current plots are loosely coupled. One area of improvement would be to have pop-up charts upon clicking of data points on existing visualizations to showcase extended information that would be of relevance.

The scope of STAGEM could be extended to cover Mini-challenges 1 and 3 to provide a comprehensive integrated visual analytics system that helps to resolve the overarching case scenario. Additional analyses relating to mini-challenges 1 and 3 such as text analytics and sentiment analysis can help to complement findings from the existing application to provide a much more cohesive narrative to the storyline.

Conclusion

Through the utilisation of interactive visual analytical tools in the Shiny application, users can visualise patterns and anomaly activities of the employees. The user-friendly layout incorporated with drop down filters allow users to slice the data and utilise the interactivity of each visualization to investigate the data without knowledge of programming or data analytics.

Acknowledgement

We would like to thank Professor Kam Tin Seong for his guidance throughout the tenure of the project.

STAGEM: Saptiotemporal Analysis of Gastech Employees’ Movement Data